19
NLP: 1-2-3
1. Identify Examples: collect examples (typically through manual
record review) of how the concept being sought is documented
in EMR. Hundreds of examples usually necessary for a
comprehensive description.
2. Learn from Examples: based on the examples, create a
language model that can recognize the concept being sought.
This step can be manual (e.g. in rule-based systems) or
automated (in machine-learning-based systems).
3. Evaluate the Language Model: test the language model on a
new set of examples that were not used to create to determine
its accuracy. Several dozen examples typically necessary to
have sufficiently narrow confidence intervals.